I have been building scalable data systems for over a decade. Here I am writing about my experiences, and I hope you find these posts useful. Your comments and feedback are greatly appreciated. — Rares Vernica

Dual-Engine Serverless SQL Lakehouse

Powerful lakehouse with DuckDB, Neon PostgreSQL, and Google Cloud Storage πŸŒŠπŸ¦†πŸ˜

The concept of a data lakehouse is gaining serious traction, and for good reason! It combines the flexibility and cost-effectiveness of a data lake with the data management features and ACID transactions of a data warehouse. Let’s dive into building a practical lakehouse setup using some fantastic open-source and cloud-native... [Read More]

Machine Learning in SciDB

Leveraging the SciDB Stream plug-in

Popular data processing platforms offer users the ability to inject an external process into the data processing pipeline. The data flowing through the data pipeline is fed as input to the external process, while the output produced by the process is fed back into the pipeline. The external process runs... [Read More]

Extending SciDB - Part 1

Documentation and Development Tools

One of the strengths of SciDB over other database management systems is its extensibility.1 SciDB allows the user to add new data types, functions, and operators. In this multi-part post, we discuss various aspects of extending SciDB. In this post we look at the available documentation and how to setup... [Read More]